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R A PTC GROT JND OF THE INVENTION 

This invention relates in general to scatterometers and in particular, to 
spectroscopic scatterometric systems and methods employing models for measuring 
10 parameters of a diffracting structure as well as related applications to sample 
processing. 

As the integration and speed of microelectronic devices increase, circuit 
structures continue to shrink in dimension size and to improve in terms of profile 
edge sharpness. The state-of-the-art devices require a considerable number of 

15 process steps. It is becoming increasingly important to have an accurate 
measurement of submicron linewidth and quantitative description of the profile of 
the etched structures on a pattern wafer at each process step. Furthermore, there 
is a growing need for wafer process monitoring and close-loop control such as 
focus-exposure control in photolithography. 

20 DifFraction-based analysis techniques such as scatterometry are especially 

well suited for microelectronics metrology applications because they are 
nondestructive, sufficiently accurate, repeatable, rapid, simple and inexpensive 
relative to critical dimension-scanning electron microscopy (CD-SEM). 

Scatterometry is the angle-resolved measurement and characterization of 

25 light scattered from a structure. For structures that are periodic, incident light is 
scattered or diffracted into different orders. The angular location 6 r of the m* 



diffraction order with respect to the angle of incidence 9, is specified by the grating 
equation: 

(1) 

X 

sin^ + sin0 r = m— 

where X is the wavelength of incident light and d the period of the diffracting 
structure. Spectral scatterometry performs the above measurement using a variety 
of transmitted light that can be used for measurement of the grating parameters. 

The diffracted light pattern or spectrum from a structure can be used as a 
"fingerprint" or "signature" for identifying the dimensions of the structure itself. In 
addition to period, more specific dimensions, such as width or critical dimension 
(CD), step height (H), and the shape of the line, and angle of the side-walls (SWA), 
or other variables referred to below as parameters of the structure, can also be 
measured by analyzing the scatter pattern. 

In scatterometry, a diffraction model of the diffracting structure or grating 
is first constructed. Different grating parameters outlined above are parameterized 
and the parameter space is defined by allowing each parameter to vary over a certain 
range. A look up table is then constructed offline prior to measurements. The look 
up tables, also-called libraries, are multi-dimensional with the parameters such as 
CD, height and wall angle as the variable of each dimension. The tables contain 
typically, a collection of spectra where each spectrum is a plot of a measured 
diffraction reflectance or transmittance versus wavelength or illumination angle 
corresponding to a particular set of values of the parameters. After the sample 
spectrum is measured, it is compared to all the spectra in the look up table to find 
the best match and the value or values of the one or more parameters are then 
determined by the values at which the best match is found. 

The look up tables are multi-dimensional and need to cover a number of 
parameters extending over different ranges. The end result is a multi-dimensional 
sampling grid with each point on the grid being a spectrum that contains hundreds 
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of data points. Such tables are extremely time consuming to calculate and difficult 
to refine. If any parameter during real time measurement falls outside the sampling 
grid, or any dependent variables are different from what have been used for 
constructing the look up table, then the tables become useless and have to be 
5 reconstructed, which may take days. This drawback significantly reduces the value 
of integrated CD measurement systems, of which the main goal is to reduce the time 
delay from process to metrology results. 

It is, therefore, desirable to provide an improved technique for deriving the 
important parameters of the diffracting structure from the measured data. 

10 

SUMMARY OF THE INVENTION 

As noted above, the multi-dimensional look up table used in conventional 
scatterometry is extremely time consuming to calculate and difficult to refine. This 
invention is based on the recognition that processing delays can be much reduced by 

15 making use of knowledge of the diffraction structure to be measured. Thus, if the 
approximate values or ranges of values of the parameters are known, there is no 
need to employ a full sized look up table which contains all the data points over a 
full or maximum possible ranges of values for the parameters. In such event, one 
could make the "best guess" of the values of the parameters as a start and perform 

20 an optimal estimation process within a neighborhood of the "best guess" using 
measured data from the diffracting structure. Thus, this method involves computing 
only for more limited ranges of values for the parameters containing the initial set of 
guessed values of the parameters, which ranges of values may be smaller than those 
in the conventional method using a look up table containing data points over all 

25 possible values of the parameters. 

Preferably, the optimized estimation employs non-linear regression or 
simulated annealing. In one embodiment, the initial set of guessed values is found 
as follows. First a coarse library of sets of data related to the diffraction at different 
wavelengths is constructed where the sets of data are generated assuming 

30 corresponding sets of values of the parameters covering the maximum possible or 
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relatively large ranges of values. The diffracting structure may then be measured and 
the measured data is compared to the library to find the initial set of guessed values 
of parameters. 

To speed up the process of matching measured data from the diffracting 
5 structure to those provided by the model which includes calculation of eigenvalues, 
the processing can be simplified and made faster by storing the eigenvalues so that 
the eigenvalues will not have to be recalculated every time matching is performed for 
different diffraction structures. The eigenvalues are then used to obtain the value of 
one or more parameters of a diffracting structure from measured data from the 

10 structure. In another embodiment, a look up table of the eigenvalues may also be 
pre-computed so that any eigenvalue within the needed range may be calculated with 
interpolation from the eigenvalue look up table. This will make the calculation of 
eigenvalues easier, faster, and more reliable, which improves the most time 
consuming and least robust part of modelling. 

15 The diffracting structure measured frequently sits underneath and/or over a 

stack of one or more layers of material so that when electromagnetic radiation is 
directed at the structure to perform measurements, the measurements will be 
affected by the effects of such layers on the measurements. Therefore, where 
measurement of the structure is carried out by directing a polychromatic beam of 

20 electromagnetic radiation and detecting corresponding data of a diffraction of the 
beam at a number of wavelengths, the wavelengths at which the data on the structure 
is measured may be chosen as a function of the properties of the one or more layers. 
In one example, the wavelengths are chosen so that the measurements are less 
affected by the properties of the one or more layers. In this manner, measurement 

25 of similar structures will be less affected by the different layers in their vicinity. 

When measurement of the structure is carried out by means of a 
polychromatic beam of electromagnetic radiation and detection of corresponding 
data of the beam at a number of wavelengths, the measured data may change more 
significantly over one wavelength range than over another. Thus, to provide more 

30 accurate sampling representation of the spectra, the density of the data samples over 
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the wavelengths may be chosen as a function of sensitivity of the data to changes in 
the one or more parameters over the different wavelengths. 

To model a three-dimensional diffracting structure, a model of the structure 
may be provided by cutting a three-dimensional contour resembling a portion of the 
5 structure along planes parallel to a reference plane to obtain a pile of slabs. An array 
of rectangular blocks arranged along planes parallel to the reference plane may be 
formed to approximate each slab. An analysis such as multimodal analysis may be 
performed for each of the arrays to find a one-dimensional solution and the solutions 
of adjacent blocks are matched to find a two-dimensional solution for the array. A 

10 three-dimensional solution for the contour may then be formed from the 
two-dimensional solutions of the arrays. 

Where measurements according to any one or more of the above-described 
features are performed by directing a polychromatic beam of electromagnetic 
radiation at the diffracting structure and data is detected from the structure, 

15 preferably the data measured includes intensities or changes in polarization state of 
the diffraction of the radiation of the structure. 

Line roughness in the form of slight variations in grating line patterns may 
be present on some samples. A beam of radiation is directed towards the grating 
lines in an incident plane which is substantially perpendicular to the lines, where the 

20 radiation supplied is of a known polarization state. By measuring the change in 
polarization state caused by diffraction by the lines, a measure of the line roughness 
can be obtained. Preferably, the incident radiation is linearly polarized of S- or 
P-polarization and a cross-polarization coefficient may be measured from the 
diffracted radiation as an indication of line roughness. 

25 Any one of the above-described techniques may be used to find the value(s) 

of one or more parameters of a diffracting structure, and such value may be supplied 
to a sample processing machine, such as a stepper and/or etcher to control the 
lithographic and/or etching process in order to compensate for any errors in one or 
more of the parameters that has been discovered. The stepper and/or etcher may 
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form an integrated single tool with the system for finding the one or more 
parameters of a diffracting structure, or may be instruments separate from it. 

Any of the techniques described above may be performed by means of 
software components loaded into a computer or any other information appliance or 
digital device. When so enabled, the computer, appliance or device may then 
perform the above-described techniques to assist the finding of value(s) of the one 
or more parameters using measured data from a diffracting structure. The software 
component may be loaded from a fixed media or accessed through a communication 
medium such as the internet or any other type of computer network. 



BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1A is a schematic view of a spectroscopic scatterometer useful for 
illustrating the invention. 

Fig. IB is a cross-sectional view of a two-dimensional grating useful for 
1 5 illustrating the invention. 

Fig. 2 is a flow diagram illustrating an optimization estimation method to 
illustrate one aspect of the invention. 

Fig. 3 is a schematic view of a large course library and small local libraries 
useful for illustrating the invention. 
20 Fig. 4 is a graphical plot of reflectance spectrum of a resist grating on silicon 

to illustrate the sensitivity of the spectrum to the critical dimension (CD) and system 
noise to illustrate the invention. 

Fig. 5 is a. flow chart illustrating a method for propagating S-matrices in a 
sample including a diffraction grating, a top film stack on the grating and a bottom 
25 film stack below the grating to illustrate the invention. 

Fig. 6 is a schematic view of multi-slab model for approximating a portion 
of a grating useful for illustrating the invention. 

Fig. 7 is a graphical plot of the skin depth of polysilicon at different 
wavelengths of light. 
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Fig. 8 is a graphical plot of a reflectance spectrum of a diffraction as a 
function of wavelength to illustrate the sensitivity of the spectrum over different 
wavelengths. 

Fig. 9 is a flow diagram illustrating the construction of a model to 
5 approximate a portion of a three-dimensional diffracting structure for illustrating the 
invention. 

Fig. 10 is a schematic view of an array of rectangular blocks for 
approximating an elliptical or circular cylinder in the model of Fig. 9. 

Fig. 1 1 is a perspective view of a two-dimensional grating relative to a plane 
10 of incidence of a measurement beam to illustrate a method for measuring line 
roughness. 

Fig. 1 2 is a schematic block diagram illustrating a wafer processing apparatus 
including a stepper and an etcher and a scatterometer where parameter information 
from a diffracting structure and/or associated structures from the scatterometer is 
15 used to control the manufacturing process in the stepper and/or etcher to illustrate 
the invention. 

Fig. 13 is a block diagram showing a representative sample logic device in 
which aspects of the present invention may be embodied. 

For simplicity in description, identical components are labeled by the same 
20 numerals in this application. 

DETAILED DESCRIPTION OF THE SPECIFIC EMBODIMENTS 

Even though much of the description below of algorithms and methods are 
described in terms of the reflected or transmitted intensities of the diffraction caused 

25 by the diffracting structure, it will be understood that the same techniques and 
algorithms may be used for data containing information concerning changes in the 
polarization state over different wavelengths (e.g. ellipsometric parameters and 
as functions of wavelength). For this reason, it may be advantageous to employ an 
instrument which is capable of measuring both the reflected or transmitted intensities 

30 of the diffraction caused by the structure as well as changes in polarization state 
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caused by the diffraction of the structure. A suitable system is described below in 
reference to Fig. 1 A. 

Fig. 1A is a schematic view of a spectroscopic scatterometer system to 
illustrate the preferred embodiment of the invention. As shown in Fig. 1 A, system 
5 10 may be used to measure reflected or transmitted intensities or changes in 
polarization states of the diffraction. As shown in Fig. 1 A, a semiconductor wafer 
1 1 may comprise a silicon substrate 12, and a structure 16 thereon that may include 
a photoresist pattern on and/or over film stack(s), where the film(s) are at least 
partially light-transmissive and has a certain film thickness and refractive index (n 

10 and k, the real and imaginary components of the index). 

An XYZ stage 14 is used for moving the wafer in the horizontal XY 
directions. Stage 14 may also be used to adjust the z height of the wafer 11. A 
broadband radiation source such as white light source 22 supplies light through a 
fiber optic cable 24 which randomizes the polarization and creates a uniform light 

15 source for illuminating the wafer. Preferably, source 22 supplies electromagnetic 
radiation having wavelengths in the range of at least 1 80 to 800 nm. Upon emerging 
from fiber 24, the radiation passes through an optical illuminator that may include 
an aperture and a focusing lens or mirror (not shown). The aperture causes the 
emerging light beam to image a small area of structure 16. The light emerging from 

20 illuminator 26 is polarized by a polarizer 28 to produce a polarized sampling beam 
30 illuminating the structure 16. 

The radiation originating from sampling beam 30 that is reflected by structure 
16, passed through an analyzer 32 and to a spectrometer 34 to detect different 
spectral components of the reflected radiation, such as those in the spectrum of the 

25 radiation source 22, to obtain a signature of the structure. In one mode 
(spectrophotometry mode) of operation, the reflected intensities are then used in a 
manner described below to find the value(s) of one or more parameters of structure 
16. The system 10 can also be modified by placing the spectrometer 34 on the side 
of structure 16 opposite to illumination beam 30 to measure the intensities of 

30 radiation transmitted through structure 16 instead for the same purpose. These 
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reflected or transmitted intensity components are supplied to computer 40. 
Alternatively, the light reflected by the structure 16 is collected by lens 54, passes 
through the beam splitter 52 to a spectrometer 60. The spectral components at 
different wavelengths measured are detected and signals representing such 
5 components are supplied to computer 40. The light reflected by structure 16 may 
be supplied by source 22 through illuminator 26 as described above or through other 
optical components in another arrangement. Thus, in such arrangement, lens 23 
collects and directs radiation from source 22 to a beam splitter 52, which reflects 
part of the incoming beam towards the focus lens 54 which focuses the radiation to 

10 structure 16. The light reflected by the structure 16 is collected by lens 54, passes 
through the beam splitter 52 to a spectrometer 60. 

When the system 1 0 is operated in another mode (spectroscopic ellipsometry 
mode) used to measure the changes in polarization state caused by the diffraction by 
the structure, either the polarizer 28 or the analyzer 30 is rotated (to cause relative 

15 rotational motion between the polarizer and the analyzer) when spectrometer 34 is 
detecting the diffracted radiation from structure 16 at a plurality of wavelengths, 
such as those in the spectrum of the radiation source 22, where the rotation is 
controlled by computer 40 in a manner known to those skilled in the art. The 
diffracted intensities at different wavelengths detected are supplied to computer 40, 

20 which derives the changes in polarization state data at different wavelengths from 
the intensities in a manner known to those in the art. See for example U.S. patent 
No. 5,608,526, which is incorporated herein by reference. 

Fig. IB is a cross-sectional view of the structure 16 on substrate 12, which 
structure comprises a diffracting structure 16b situated between the film stack 16a 

25 above the structure and the film stack 16c underneath the structure and an incident 
electromagnetic beam 30 to illustrate the invention. Thus, the incident beam 30 of 
the electromagnetic radiation first encounters the interface between the air and the 
film stack 16a and interfaces that may be present within the stack. Next, the portion 
of the radiation from beam 30 that penetrates the film stack 16a is diffracted by the 

30 grating structure 16b. At least some of the radiation from beam 30 will reach the 
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film stack 16c underneath the grating and be reflected by or transmitted through 
interfaces associated with stack 16c. The total light reflectance is affected both by 
the grating and by the film stacks above and/or below the grating. Multi-layer 
interference, caused by multiple reflections between the films and the grating, creates 
5 a complicated pattern in a reflectance spectrum, which can be used for measuring 
parameters of the structure. A part of radiation from beam 30 that is not reflected 
or diffracted as described above will be transmitted into the substrate 12. As shown 
in Fig. IB, the grating 16b has a height of H, a critical dimension CD and a side wall 
angle (SWA) as indicated. 

10 Fig. 2 is a flow diagram illustrating a method for finding the value of one or 

more parameters of the grating structure 16b to illustrate an embodiment of the 
invention. As noted above, in conventional methods parameter space is created by 
allowing each of the parameters such as CD, H, SWA to vary over a wide range of 
values, such as all possible values at the step sizes dictated by precision requirement 

15 for the parameter. The predicted diffracted intensities are then calculated based on 
each set of parameter values to construct a multi-dimensional look up table. The 
intensities of the diffraction at different wavelengths from structure 16b is then 
measured using a spectrophotometer, such as the one shown in Fig. 1A. The 
measured diffracted intensities are then matched against the spectra in the look up 

20 table to find a best match. As noted above, the construction of the look up table is 
time consuming and cumbersome, since it covers a large number of values for the 
parameters. The process of matching can also be time consuming in view of the size 
of the table. 

According to one aspect of the invention, the value of the one or more 
25 parameters is found by means of an optimized estimation process instead of by 
means of the multi-dimensional look up table. Thus, if the approximate value(s) of 
the one or more parameters is known, such value may be used as the starting point 
for the optimized estimation process, or the "best guess" value(s) of the one or more 
parameters of the grating structure. Thus, a set of predicted intensity data of the 
30 diffraction at multiple wavelengths is calculated according to the "best guess" 
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value(s) of the one or more parameters of the grating structure. An optimized 
estimation process is then performed within the neighborhood of the predicted set 
of intensity of data using the measured intensities to arrive at a second value(s) of 
the one or more parameters. In one embodiment, the optimized estimation process 
5 employs nonlinear regression or simulated annealing. The above-described process 
is much faster than the conventional process using a multi-dimensional look up table. 

In some circumstances, the approximate value(s) of the one or more 
parameters may not be known beforehand. In such circumstances a coarse library 
of spectra may be constructed as follows. The values of the parameters are allowed 

10 to vary substantially over their maximum possible ranges and spectra of predicted 
diffraction intensities over multiple wavelengths are then calculated based on such 
values of the parameters, using a diffraction model 112. The measured diffraction 
intensities are then matched against the spectra in the coarse library to find a "best 
guess" spectra as the starting point of the binary sequential estimation process. 

1 5 Different from the conventional method, however, since the goal of constructing the 
coarse library is merely to find the starting point of the optimized estimation process, 
the resolution of the library can be coarse, since the accuracy of estimation does not 
depend solely upon the resolution of the library, different from the conventional 
method. 

20 The above-described process is illustrated in Fig. 2. First a coarse library of 

spectra of diffracted intensities over multiple wavelengths is constructed, each 
spectra being calculated assuming a corresponding set of values of the one or more 
parameters such as CD, H, SWA and other parameters as shown in block 102. The 
diffraction intensities from the structure are then measured as shown in block 104. 

25 The measured data 106 is then matched against the spectra in the library 102 as 
shown in block 108 to find a best match. The best match is then used as the "best 
guess" and the starting point of the optimized estimation process, such as nonlinear 
regression (block 110). The nonlinear regression process is then performed by 
varying the values of the set of parameters within the neighborhood of the "best 

30 guess" initial set of values for the parameters using a diffraction model 1 12 to arrive 
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at a second set of values (114) that is a better fit to the measured data then the "best 
guess." This process may need to be repeated for some applications to arrive at a 
good fit to the measured data. The value(s) of the one or more parameters 
corresponding to this second set then give the dimensions of the structure 16b. 
5 In the process described above in reference to Fig. 2, the data that is 

predicted or measured comprise the diffracted intensities from the grating structure. 
Instead of predicting or measuring diffracted intensities, it is also possible to find the 
value(s) of the one or more parameters by means of measuring and predicting 
changes in polarization state instead. In such event, instead of measuring the 

10 diffracted intensities using spectroreflectometer 60 (or spectrometer 34 but without 
rotating polarizer 28 or analyzer 32), SE spectrometer 34 from Fig. 1A is used 
instead to measure the change in polarization state of the diffraction to arrive at 
measured data 106 in Fig. 2. The diffraction model 1 12 is then modified to generate 
spectra of predicted changes in polarization state over multiple wavelengths as 

15 functions of different sets of values of the one or more parameters to arrive at the 
coarse library 102. 

For the method of finding the value(s) of the one or more parameters using 
change in polarization state data, analogous to the situation involving reflected or 
transmitted intensity data as indicated above, where the "best guess" in terms of 

20 changes in polarization state is known beforehand, there is no need to construct a 
coarse library of spectra 102 at all. One may then also omit the step of comparing 
the measured data to the spectra in the coarse library 102 to find the "best guess." 

Fig. 3 is a schematic view illustrating an alternative method for finding the 
value(s) of the one or more parameters of the diffracting structure. Instead of 

25 constructing a multi-dimensional look up table of high resolution, a coarse library 
102 may first be constructed as described above. The measured data 106 as shown 
in Fig. 2 is then matched against the spectra in the coarse library 102 to find the best 
fit 108. This is shown schematically in Fig. 3 as one of the sites. In some situations, 
there may be more than one spectra that fit the measured data 106, in which case 

30 there would be multiple sites, such as n sites shown in Fig. 3 where n is a positive 
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integer. Then a small fine local library or look up table may be constructed for each 
site where the parameters are allowed to vary over a small range in the neighborhood 
of each site. The measured data is then compared to the spectra in the small fine 
local library to find the best match. 
5 Any standard search algorithm, such as , for example, bi-section, can be 

applied to each local box to make local matching even more efficient. In such cases, 
not all the points of the local box are pre-calculated, only the points along the 
searching path are calculated, which reduces the amount of computing. 

Applicants have found that the measured data, such as diffraction intensities 

10 or changes in polarization state, are more sensitive at certain wavelengths of the 
radiation compared to others as shown in Fig. 4. Noise in the system, such as that 
in system 10 shown in Fig. 1A, may also be a function of wavelengths, also 
illustrated in Fig. 4. Therefore, it will be desirable to operate the above-described 
process or method using data at wavelengths where the diffraction intensity or 

1 5 change in polarization state is the most sensitive. It may also be desirable to operate 
the above-described process or method using data at wavelengths where the system 
noise is low or at a minimum. In the example shown in Fig.4, one would, therefore, 
choose to measure and calculate using a model diffraction data at wavelengths in the 
neighborhood of 410, 525 or 570 nm but would avoid diffraction intensity data 

20 above 580 nm. 

In the analysis of two-dimensional diffraction gratings, it is conventional to 
employ a stack of lamellar grating layers to approximate an arbitrary profile. See, 
for example, the article "Multilayer Model Method for Diffraction Gratings of 
Arbitrary Profile, Depth, and Permativity," by Liefeng Li, J. Opt Soc. Am. A, Vol. 

25 10, No. 12, December 1993, pages 2581-2591. The interaction between abeam of 
electromagnetic radiation and the grating is modeled using methods such as a 
multi-modal method or a rigorous coupled-wave analysis method. These methods 
involve the calculation of eigenvalues. However, in all of the methods proposed 
before this invention, the eigenvalues are calculated from scratch each time the 

30 parameters of a grating are to be determined. Since the calculation of eigenvalues 
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is time consuming and cumbersome, the methods proposed are likewise cumbersome 

and time consuming. 

Another aspect of the invention is based on the recognition that frequently 

different diffraction gratings measured may differ in only certain respects so that the 
5 data such as eigenvalues obtained with respect to portions of the gratings that are 

the same may be stored for future reference and reused, thereby saving time and 

effort in the calculation. 

A look up table of the eigenvalues may also be pre-computed, so that any 

eigenvalue within a certain range may be calculated with interpolation from the 
10 eigenvalue look up table. Any of standard interpolation routines, such as linear, or 

cubic splines, may be used for eigenvalues interpolation. This will make the 

calculation of eigenvalues easier, faster, and more reliable. 

The algorithms, including storing and subsequent re-use, and the other 

algorithms, including pre-computing look up table and subsequent interpolation, may 
15 be used also for S-matrices, as described below. 

This is illustrated in Figs. 5 and 6. In reference to Figs. IB and 5, S-matrix 

propagation is first performed in the bottom film stack 16c (block 150). Then the 

grating S-matrix is calculated using a multi-layer model containing a stack or pile of 

slabs. This is shown more clearly in Fig. 6. As shown in Fig. 6, the left side 160 of 
20 a portion of a grating line in Fig. IB is approximated in shape by four slabs, 162a, 

162b, 162c and 162d. The S-matrix is calculated for each of the four slabs. Then 

the total grating S-matrix (block 1 52) is calculated from the S-matrices for the slabs. 

Then the S-matrix is found for the top film stack 16a (block 154). 

One aspect of the invention is based on Applicants' observation that for a 
25 number of different diffraction gratings that are modeled to create a look up table, 

even though their profiles differ, the bottom portions of the gratings still may be 

essentially the same. In such circumstances, the S-matrix values of the bottom slabs 

in the stack or pile 162 may be the same for all the gratings, even though the top 

number of slabs may differ. In such circumstances, it may be useful to store the 
30 S-matrix values for all the slabs after they have been calculated for one diffracting 
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grating, so that the S-matrix data for the slabs used to model another different 
grating structure can be re-used and do not have to be recalculated. This saves time 
and effort. Thus, as shown in Fig. 6, after the grating structure with the left side 160 
has been modified, a different structure is to be modeled, and another different 
5 multi-layer model is constructed with slabs. It may happen that the bottom two slabs 
for this different grating structure turn out to be the same as slabs 162a, 162b for 
side 160, whereas the top number of slabs are different from slabs 162c and 162d. 
In such circumstances, the S-matrix values for slabs 162a, 162b already obtained in 
the modeling of the grating with left side 160 may be reused for the other different 
10 structure. 

Thus as noted above, it would be useful to store the values of the S-matrix 
of one or more of the slabs at or near the bottom of the pile 162. Where the new 
structure whose parameters are of interest differ from the one already modeled only 
in some of the slabs at the top, all one needs to do is alter the dimensions of one or 

15 more slabs at the top of the pile 162 to approximate such other new and different 
grating structure and reuse the stored S-matrices of some of the slabs at or near the 
bottom of the pile for obtaining the value(s) of the one or more parameters of the 
other diffracting structure. 

While the diffracting structure may comprise a single material, it is also 

20 possible for the structure to comprise layers of different material. The 
above-described multi-layer model accounts for the same or different kinds of 
materials in the structure. The manner in which the materials are taken into account 
is known to those skilled in the art and will not be described here. 

Where the model employed is a rigorous coupled-wave analysis model, the 

25 model also calculates eigenfunctions. The eigenfunctions may also be stored in 
addition to the eigenvalues for use in the modeling and analysis of other diffracting 
structures. 

In addition to modeling the pile of slabs 162, the model employed may also 
include the propagation of S-matrices through the bottom film stack (block 1 50) and 
30 through the top film stack (block 1 54). Where other different diffracting structures 
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to be modeled are situated over similar bottom film stacks 16c or underneath similar 
top film stacks 16a, the values of such S-matrices for the film stacks may be reused 
in the measurement of such other different grating structure, so that these matrices 
do not have to be recalculated. This saves time and effort. Therefore, according to 
5 another aspect of the invention, the values of the S-matrices for the bottom and top 
film stacks are also stored for use in finding the value(s) of the one or more 
parameters of a different diffracting structure associated with similar bottom and top 
film stacks. Obviously, the S-matrices for the top and bottom film stacks may be 
used independently of one another so that the calculation for another diffracting 

10 structure may involve only the top or bottom film stack S-matrices, but not both. 

As shown in Fig. IB, a grating structure 16b may be situated under or over 
film stacks. In some semiconductor wafers, these film stacks may vary in their 
optical characteristics across the wafer or between different wafers that have the 
same diffraction grating. In such circumstances, it may be desirable to process only 

15 data obtained at wavelengths that are less sensitive to these variations in optical 
characteristics of the film stacks. In this manner, the effect of the variations on the 
calculations would be less pronounced, which would simplify the calculations. This 
is illustrated, for example, in Fig. 7 which is a graphical plot of the skin depth of 
polysilicon versus wavelength. As shown in Fig. 7, polysilicon is substantially 

20 opaque at wavelengths from the ultraviolet range to about 3 80 nm. Therefore, if the 
intensity or change in polarization state data that are analyzed are only those for this 
wavelength range, the effect of variations in optical characteristics of the top or 
bottom film stacks on the measurements can be essentially ignored. 

Even if the underlying film is not opaque, its influence on the reflectance 

25 from the whole structure my be negligible at certain wavelengths due to multi-layer 
interference. For these wavelengths the spectrum is insensitive to film stack 
fluctuations, yet remains sensitive to grating parameters. Therefore, we can use 
reflectance at these wavelengths to measure the grating parameters, while ignoring 
fluctuations in film stacks. 
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The intensity or change in polarization state data may be more sensitive as 
a function of wavelength to the change in the value(s) of the one or more parameters 
at certain wavelengths than at other wavelengths. Another aspect of the invention 
is based on the observation that by increasing the density of data points of the 
5 intensity or change in the polarization state data at wavelengths where the data 
exhibit sharp peaks or valleys as a function of wavelength, the spectral signature 
indicated by the resulting data points may be more accurate. This is illustrated in 
Fig. 8 which is a graphical plot of diffraction intensity versus wavelength of a 
particular diffraction structure. Thus, as can be seen from Fig. 8, the change in the 

10 diffraction intensity versus wavelength is rather smooth at certain wavelengths, such 
as for wavelengths within the regions 182 and 188. There are, however, other 
regions in which the diffraction intensity changes rapidly with a small change in 
wavelength, such as within regions 184 and 186; within such regions, it would be 
desirable to increase the density of data samples to more accurately represent the 

15 shape of the curve, which would yield more accurate results in subsequent 
processing and curve fitting or data matching. 

Three-Dimensional Grating 

In semiconductor fabrication, three-dimensional diffracting structures are 
20 sometimes encountered, where the structure comprises a two-dimensional layout of 
hills on top of an underlying film stack (it may also be underneath a top film stack). 
Structure 200 illustrates one period of the grating. 

To model the three-dimensional grating of which structure 200 is a part, a 
pseudo-periodic solution in the grating is to be found which matches with plane 
25 waves outside the grating comprising incident and reflected waves. To obtain the 
solution, the three-dimensional structure 200 within one period is considered. In this 
way, the entire structure 200 is approximated with a pile of cylinders or slabs. 

A solution for each slab is first found, where the solution is a product of a 
vertically propagating plane wave times the horizontal two-dimensional solution of 
30 the pseudo-periodic boundary-value problem in the cross-section plane, where the 




• 



boundary condition in the y direction are shown in Fig. 10 and is set forth below as 



5 where £(i ) > / w/+1 ) is the electric field at x 9 y m+1 ; 

y 0 ? y m+ j are the first and last interfaces of the array of rectangular blocks for 
approximating a slab as described below and illustrated in Fig 1 0 as blocks 2 1 0-2 1 8; 
m is the number of rectangular blocks in the array; 
k 0y is wave number in the y direction; and 
10 d is the period in the y direction. 

As noted from Fig. 9, conical structure 200 is situated on a reference plane 
202. A model of structure 200 is provided by cutting a three-dimensional contour 
resembling a representative portion such as structure 200 of the three-dimensional 
grating along planes parallel to plane 202 to obtain a number of slices such as slice 
15 200(i) with i ranging from 1 to p, where p is a positive integer. Then each slice is 
approximated in shape by a cylindrical slab to arrive at structure 200', where the 
modeling is illustrated schematically by arrow 204 in Fig. 9. For purposes of 
illustration, only five cylinders are illustrated in structure 200'. Where structure 200 
is not conical in shape, the five slabs 200'(1),...200"(5) will not be circular cylinders 
20 but can take on other shapes in their cross-section, such as ellipses or other shapes. 
In general, structure 200 may be approximated by a contour 200' comprising a pile 
of a plurality of slabs, where each slab may take on the shape of a circular cylinder, 
elliptical cylinder or other suitable shapes. For each slab 200'(i) in contour 200', i 
ranging from 1 to p, an array of rectangular blocks is constructed to approximate the 
25 shape of the slab as illustrated in Fig. 10. As shown in Fig. 10, a particular slab 
which is substantially elliptical in shape is approximated (arrow 224) by an array of 
five rectangular blocks 210, 212, 214, 216 and 218. 

As described above, to obtain the two-dimensional solution, each slab such 
as 200'(i) is approximated by an array of rectangular blocks. In the embodiment of 
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Fig. 10, slab 2O0'(0 is approximated by five rectangular blocks 210-218. A 
one-dimensional boundary value problem for each of the five blocks 210-218 is 
found in a manner known to those skilled in the art. The one-dimensional solutions 
for neighboring blocks 210-218 are then matched to construct a two-dimensional 
5 solution for slab 200'(i) which is a product of one-dimensional solutions of the 
rectangular blocks and a plane wave propagating in the y direction with the wave 
number k y . The right value of k y is then searched to make the two-dimensional 
solution meet pseudo-periodic boundary conditions in the y direction. The 
two-dimensional solution found in this manner is a two-dimensional eigenfunction. 

10 The two-dimensional solutions found for each of the slabs is then matched with the 
plane waves outside the contour 200' to construct a three-dimensional solution for 
the contour 200'. The three dimensional solution for the whole grating is then found 
which provides the diffraction efficiencies both for the reflected and transmitted 
waves. This three-dimensional solution may be stored in a database and be provided 

15 as a datasource useful for finding a value related to one or more parameters of a 
three-dimensional diffracting structure. 

Fig. 1 1 is a perspective view of a two-dimensional grating relative to a plane 
of incidence of a measurement beam to illustrate a system for measuring line 
roughness. Because of the way semiconductor wafers are processed, sometimes line 

20 roughness on the wafer surface may result from the manufacturing processes. The 
line roughness is deviation of the two-dimensional grating from straight line shape. 
Another aspect of the invention is based on the observation that by measuring the 
effects of the lines on polarization of reflected light, a measure of the line roughness 
can be obtained. This is illustrated in Fig. 11. A wafer surface 300 has thereon an 

25 array of two-dimensional gratings 3 02, which is shown schematically to illustrate the 
concept. A beam 304 of radiation is incident on the grating 302 in the plane of 
incidence 306 which is preferably, substantially orthogonal to the grating lines 302. 
Beam 304 is polarized and has a known polarization state. Light that is diffracted 
by grating 302 is then detected using the spectroscopic ellipsometer 34 as described 

30 above or any other suitable instrument (for example, a polarizer combined with a 
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spectrometer) to measure the change in polarization state of the beam 304 caused 
by grating 302. The change in polarization state so measured may indicate 
characteristics of the grating 302. 

In one embodiment, beam 304 is linearly polarized in a direction in the plane 
5 of incidence 306 (P-polarization) or in a direction substantially normal to plane 306 
(S-polarization). The spectroscopic ellipsometer 34 is then used to measure the 
cross-polarization coefficient as a measure of the line roughness. In other words, if 
beam 304 is S-polarized, the spectroscopic ellipsometer would measure the intensity 
of P-polarization components of the reflected radiation where the ratio of the 

10 P-polarization components to the intensity of the illumination beam 304 would give 
the cross-polarization coefficient. If beam 304 is P-polarized, the spectroscopic 
ellipsometer would measure the intensity of S-polarization components of the 
reflected radiation where the ratio of the S-polarization components to the intensity 
of the illumination beam 304 would give the cross-polarization coefficient. If there 

15 is no line roughness, this coefficients would be zero. 

Fig. 1 2 is a block diagram of an integrated scatterometer, a photolithographic 
stepper and an etcher to illustrate another aspect of the invention. A layer of 
material such as photoresist is formed on the surface of a semiconductor wafer by 
means of stepper 350, where the photoresist forms a grating structure on the wafer. 

20 One or more of the CD, H, SWA and other parameters of the grating structure are 
then measured using system 10 of Fig, 1 A and one or more of the above-described 
techniques may be employed if desired to find the value(s) of the one or more 
parameters of the photoresist pattern. Such value(s) from the computer 40 are then 
fed back to stepper 350, where such information may be used to alter the 

25 lithographic process in stepper 350 to correct any errors. In semiconductor 
processing, after a layer of photoresist has been formed on the wafer, an etching 
process may be performed, such as by means of etcher 360. The layer of photoresist 
is then removed in a manner known in the art and the resulting grating structure 
made of semiconductor material on the wafer may again be measured if desired using 

30 system 10. The value(s) measured using any one or more of the above-described 
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techniques may be supplied to the etcher for altering any one of the etching 
parameters in order to correct any errors that have been found using system 10. Of 
course the results obtained by one or more of the above described techniques in 
system 10 may be used in both the stepper and the etcher, or in either the stepper or 
the etcher but not both. The stepper 350 and/or etcher 360 may form an integrated 
single tool with the system 10 for finding the one or more parameters of a diffracting 
structure, or may be separate instruments from it. 

Software Upgrades 

The invention has been described above, employing a system such as that 
shown in Fig. 1 A. While the various optical components in the system of Fig. 1A 
are used to obtain measured data from the sample, many of the other processes are 
performed by computer 40. Thus, for many systems currently being used by 
manufacturers such as semiconductor manufacturers, the computers used in the 
systems may not have the capability to perform the techniques described above. 
Thus, another aspect of the invention envisions that the software in these computers 
can be upgraded so that computer 40 can perform one or more of the above 
described different functions. Therefore, another aspect of the invention involves the 
software components that are loaded to computer 40 to perform the above-described 
functions. These functions, in conjunction with the optical components of system 
10 in Fig. 1A, provide results with the different advantages outlined above. The 
software or program components may be installed in computer 40 in a variety of 
ways. 

As will be understood in the art, the inventive software components may be 
embodied in a fixed media program component containing logic instructions and/or 
data that when loaded into an appropriately configured computing device to cause 
that device to perform according to the invention. As will be understood in the art, 
a fixed media program may be delivered to a user on a fixed media for loading in a 
users computer or a fixed media program can reside on a remote server that a user 
accesses through a communication medium in order to download a program 
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component. Thus another aspect of the invention involves transmitting, or causing 
to be transmitted, the program component to a user where the component, when 
downloaded into the user's device, can perform any one or more of the functions 
described above. 

5 Fig. 13 shows an information appliance (or digital device) that may be 

understood as a logical apparatus that can read instructions from media 417 and/or 
network port 419. Apparatus 40 can thereafter use those instructions to direct 
server or client logic, as understood in the art, to embody aspects of the invention. 
One type of logical apparatus that may embody the invention is a computer system 
10 as illustrated in 40, containing CPU 404, optional input devices 409 and 411, disk 
drives 415 and optional monitor 405. Fixed media 417 may be used to program 
such a system and may represent a disk-type optical or magnetic media, magnetic 
tape, solid state memory, etc.. One or more aspects of the invention may be 
embodied in whole or in part as software recorded on this fixed media. 

15 Communication port 419 may also be used to initially receive instructions that are 
used to program such a system to perform any one or more of the above-described 
functions and may represent any type of communication connection, such as to the 
internet or any other computer network. The instructions or program may be 
transmitted directly to a user's device or be placed on a network, such as a website 

20 of the internet to be accessible through a user's device. All such methods of making 
the program or software component available to users are known to those in the art 
and will not be described here. 

The invention also may be embodied in whole or in part within the circuitry 
of an application specific integrated circuit (ASIC) or a programmable logic device 

25 (PLD). In such a case, the invention may be embodied in a computer understandable 
descriptor language which may be used to create an ASIC or PLD that operates as 
herein described. 

While the invention has been described above by reference to various 
embodiments, it will be understood that changes and modifications may be made 
30 without departing from the scope of the invention, which is to be defined only by the 



